SMAP: a streamlined methylation analysis pipeline for bisulfite sequencing

نویسندگان

  • Shengjie Gao
  • Dan Zou
  • Likai Mao
  • Quan Zhou
  • Wenlong Jia
  • Yi Huang
  • Shancen Zhao
  • Gang Chen
  • Song Wu
  • Dongdong, Li
  • Fei Xia
  • Huafeng Chen
  • Maoshan Chen
  • Torben F Ørntoft
  • Lars Bolund
  • Karina D Sørensen
چکیده

BACKGROUND DNA methylation has important roles in the regulation of gene expression and cellular specification. Reduced representation bisulfite sequencing (RRBS) has prevailed in methylation studies due to its cost-effectiveness and single-base resolution. The rapid accumulation of RRBS data demands well designed analytical tools. FINDINGS To streamline the data processing of DNA methylation from multiple RRBS samples, we present a flexible pipeline named SMAP, whose features include: (i) handling of single-and/or paired-end diverse bisulfite sequencing data with reduced false-positive rates in differentially methylated regions; (ii) detection of allele-specific methylation events with improved algorithms; (iii) a built-in pipeline for detection of novel single nucleotide polymorphisms (SNPs); (iv) support of multiple user-defined restriction enzymes; (v) conduction of all methylation analyses in a single-step operation when well configured. CONCLUSIONS Simulation and experimental data validated the high accuracy of SMAP for SNP detection and methylation identification. Most analyses required in methylation studies (such as estimation of methylation levels, differentially methylated cytosine groups, and allele-specific methylation regions) can be executed readily with SMAP. All raw data from diverse samples could be processed in parallel and 'packetized' streams. A simple user guide to the methylation applications is also provided.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SAAP-RRBS: streamlined analysis and annotation pipeline for reduced representation bisulfite sequencing

UNLABELLED Reduced representation bisulfite sequencing (RRBS) is a cost-effective approach for genome-wide methylation pattern profiling. Analyzing RRBS sequencing data is challenging and specialized alignment/mapping programs are needed. Although such programs have been developed, a comprehensive solution that provides researchers with good quality and analyzable data is still lacking. To addr...

متن کامل

Methy-Pipe: An Integrated Bioinformatics Pipeline for Whole Genome Bisulfite Sequencing Data Analysis

DNA methylation, one of the most important epigenetic modifications, plays a crucial role in various biological processes. The level of DNA methylation can be measured using whole-genome bisulfite sequencing at single base resolution. However, until now, there is a paucity of publicly available software for carrying out integrated methylation data analysis. In this study, we implemented Methy-P...

متن کامل

VM : a virtual machine for the integral analysis of bisulfite sequencing data

The analysis of whole genome DNA methylation patterns is an important first step towards the understanding on how DNA methylation is involved in the regulation of gene expression and genome stability. Previously, we published MethylExtract, a program for DNA methylation profiling and genotyping from the same sample. Over the last years we developed it further into a methylation analysis pipelin...

متن کامل

Computational Analysis of Genome-Wide ARGONAUTE-Dependent DNA Methylation in Plants.

Whole-genome bisulfite sequencing (WGBS) has become a powerful tool to dissect genome-wide methylation profiles at single-base resolution. In this chapter we describe in detail the bioinformatics pipeline used for the analysis of ARGONAUTE-dependent DNA methylation in Arabidopsis thaliana. We provide tools and command lines used for mapping bisulfite sequencing reads, for estimating methylation...

متن کامل

Detection of significantly differentially methylated regions in targeted bisulfite sequencing data

MOTIVATION Bisulfite sequencing is currently the gold standard to obtain genome-wide DNA methylation profiles in eukaryotes. In contrast to the rapid development of appropriate pre-processing and alignment software, methods for analyzing the resulting methylation profiles are relatively limited so far. For instance, an appropriate pipeline to detect DNA methylation differences between cancer an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2015